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TECHNICAL MEMORANDUM X-64906 


TECHNIQUES FOR TRAJECTORY OPTIMIZATION 
USING A HYBRID COMPUTER 

INTRODUCTION 


a 

Numerous applications of optimal control theory have been made to the 
control and guidance of aircraft* launch vehicles, and spacecraft. In general, 
however, optimal guidance solutions have been evaluated off-line and any on- 
board guidance updating capability has been limited to quasi-optimal schemes 
such as the Iterative Guidance Mode [ 1] . The limiting factor in any on-board 
guidance updating scheme is the computing speed available. The solution of 
optimal control problems requires repeated integration of sets of differential 
equations — a time-consuming operation. 

The proposed MASCOT system 1 2] would allow the on-board solution of 
optimal guidance problems by using new high-speed algorithms for numerical 
integration of the differential equations on a digital computer. An alternative 
approach is to use hybrid computation, with a high-speed model of the flight 
system being contained in the analog portion of the computer. This method 
will give very fast integration times but raises particular computational 
problems because of the limited accuracy and amplitude range of the analog 
variables. 

The Indirect Method [ 3] for obtaining optimal control solutions converts 
the problem into a two-point boundary- value problem. A solution can then be 
found by iteratively searching the initial condition space for the unknown initial 
condition vector of the co-state variables. This solution vector is more easily 
handled by a hybrid computer than the complete state and control vectors, 
which have to be stored when using Direct Methods [3] . 

The solution of the two-point boundary-value problem has been studied 
on an iterative analog computer [4-6] . The present research was initiated 
with the aim of implementing those techniques on a full hybrid computer and of 
studying some of the outstanding computational problems. 

The structure of this report is as follows. The Indirect Method of solu- 
tion Is formulated In the next section. The following section describes the 
automation of feedback switching [4] which is necessary to ensure convergence 
of the iteration from an arbitrary initial guess; disturbances from the nominal 



solution, which can cause the analog variables to saturate, can also be handled. 
Contour Modification was found to be helpful in the original study [5] but 
required impractical analysis to use it. A numerical approach is described in 
the section titled Contour Modification. A simulation of on-line updating [6) , 
which uses the techniques described in the previous sections, is presented in 
the section. Progressive Updating. The conclusions are given in the final 
section. 


FORMULATION 

Let the dynamic system be represented by the set of state equations, 

x(t) = _ffx(t),u(t)] ( 1) 

where x( t) is an n- vector of state variables, x.( t) , i = 1, ... n; u(t) is an 

r- vector of control variables, u.( t) , j = 1, . . . r, and _f is an n- vector of 

known functions, f , k = 1 , ... n. 
k 

The problem is to determine a control vector u*(t) (superscript * indi- 
cates an optimum quantity minimizing J) in the interval t 0 s t ^ t^ such that a 

performance index J = J [ x(t^) ] is minimized, J is considered to be of the 
form, 

h 

J - K[x( tj)) + f L [x( t) , u( t) ] dt ( 2) 

to 

so that an additional state variable must be defined by 

x 0 (t) = L[x(t),u(t)] = f 0 [x(t),u(t)] (3) 

giving x ( t) as an ( n + l) vector of state variables, x^( t) , i = 0, 1 , ... n. 
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The function u*( t) can be determined from Pontryagin’ s Maximum 
Principle by forming the Hamiltonian, 

n 

#lx( t) , p( t) , u( t) ] - J p ( t) f t(x( t) ,u( t) ] (4) 

i=0 


where p( t) is an ( n + l) vector of co- state (adjoint) variables, p.(t), i « 0, 
1, . * . n with p Q ( t) = - 1. 1 

The function p( t) is chosen such that three necessary conditions are 
satisfied [7] as follows: 

1. Canonical Equations: 


• - a# 

X j 3 Pj( t)' 

. e K 

P j "3 x.( t) ’ 


2. Boundary Conditions: 


j = 1, ... n • 


j = 1, ... n a 


( 5 ) 


Initial: 


x(t 0 ) “ (given - present state) 


( 6 ) 


final: 




( transversality condition) 


(7) 


or 


x( t f ) 


X* 


( given - target state) 


( 8 ) 
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3. Maximization of the Hamiltonians 


3ff{x( t) ,£< t) , u*( t) ] s tf[x(t) ,p(t),u(t)] 


( 9 ) 


for all t, t 0 ^ t* tj. . 

Thus, u*( t) can often be found by solving the equation, 


QX 
d u( t) 


0 


( 10 ) 


The Direct Method of solution proceeds by choosing a nominal u(t) , 
integrating the state equations forward from tf, with starting values X° , and 
storing x( t) ; then the cO-state equations are integrated backward from t^ 

with starting values p( t^) and using the stored x( t) , such that necessary 

conditions 1 and 2 are satisfied; u( t) is adjusted iteratively until condition 3 
is satisfied. 


The Indirect Method of solution uses equation ( 10) to replace u( t) as a 
function of x( t) and p( t) in the canonical equations, which are then integrated 
forward with a nominal choice of p( t 0 ) . The initial co-state vector is adjusted 
iteratively until the final conditions in condition 2 are satisfied. Thus the 
problem is converted into a classical two-point boundaiy-value problem. 

In connection with the Contour Modification technique, to be discussed 
later, an augmented performance index Jm can be formed: 


Jm 


J + 


I 

i=l 


a .e.( t ) 

i i v f 


2 


where a is an n- vector of weighting coefficients , i = 1, ... n, and £(tp 
is an n- vector of boundary errors e^ , i = 1, ... n being the errors in Satisfy- 
ing the required final conditions in condition 2. Contour Modification is con- 
cerned with the optimum selection of the weighting coefficients a . 
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FEEDBACK SWITCHING 


General Considerations 

The stabilizing effect of feeding back a function of the boundary error 
e( tp on the iterative solution of trajectory optimization problems has been 

demonstrated [4-5], although the switching point and the magnitude of each 
change were determined manually. In order to consider on-line control 
updating, the solution must be obtained completely automatically, which 
implies computer control of feedback switching. A hybrid computer is ideal 
for this purpose in that the digital computer can easily generate the logical 
decision to reduce or increase the feedback according to the progress of the 
iteration. 

The implementation of automatic feedback switching raises three basic 
questions! 

1. What should be the initial level of feedback? 

2. What fractional change in feedback should occur at each stage? 

3. What criterion should determine the switching instant? 

There is no simple answer to these questions and the various parameters must 
be determined from experience and by trial and error. This is a feature of 
any iterative method to some extent; for example, in gradient methods, step- 
size selection is always a difficult problem and stopping conditions are hard to 
define exactly. Program control of the iterative loop parameters will often be 
sufficient to ensure a convergent iteration. 

In the preceding item 1, if the nominal choice results in the overload of 
some analog variable, successive increases in feedback can be made until 
stable state and co- state trajectories are achieved. Items 2 and 3 are strongly 
interactive. The fractional change can be under program control so that a 
larger number of stages can be used when a particular problem shows itself to 
be very sensitive, or vice versa. However, if switching occurs too soon, then 
instability in the subsequent stage is likely also. 

With the object of trying to keep the overall iteration time as small as 
possible, a workable strategy is to maintain the chosen level of fractional 
change constant while trying to generate a convergent sequence of iteration 



stages by controlling the switching point. For example, if switching from one 
stage to the next occurs too soon, resulting in analog variable overload, then 
the feedback is switched back to the previous level and closer convergence 
attempted at that stage until subsequent switching results in a stable trajectory 
at the lower level of feedback. If this method breaks down, then a reduced 
fractional feedback change at each level is allowed. 

By operating this two- level form of control, it should be possible to 
achieve a sequence of iteration stages which converges to the optimum solution 
with no feedback. The tradeoff for the two control levels is between increasing 
the time spent at each iteration stage by delaying the switching point and 
increasing the number of stages by reducing the fractional feedback change, both 
factors tending to increase the overall iteration time. 

There is no precise criterion which can be used to determine the switch- 
ing point at each stage. Each subproblem has an inherent suboptimal format 
because of the inclusion of boundary-error feedback, so that the necessary con- 
ditions for optimality, which can provide a stopping condition, are no longer 
valid. An empirical criterion, which has been used successfully in this work, 
is to set a lower bound on the gradient of the objective function taken over one 
hill-climbing step. If the change in objective function falls below this bound, 
then switching occurs and the feedback is reduced ready for the next iteration 
stage. Variation in the magnitude of this lower bound provides the control in 
switching level referred to above. 


The Algorithm 

The i-th stage of iteration can be expressed by the following 
relationships [4]s 


1. Canonical Equations; 


1 25 + (x,p) 

A X 

£ = + c p 1 ^w p (£*£) 


(u) 


6 



where C , C are diagonal coefficient matrices; w (x, p) is an n~ vector chosen 
x p — x — - 

such that e ( t,) — 0 ; w ( x, p) is an n-vector chosen such that e ( t,) — 0 ; 

—x f — p — — — p f 

e x (tp is the error in satisfying boundary conditions on x(t^), and e (t^) 
is the error in satisfying boundary conditions onjj(tp. 

2. Algorithm for Computing p*^(t 0 ): The term p*^(t 0 ) is the 
value of the co- state initial vector which minimizes J[ jc( t^) 1 at the i-th stage, 




( 12 ) 


where A p^ ^ is the incremental change in the j-th approximation to 
^ > here given bj' k 9 J^^/9 ^ ( t 0 ) ( steepest descent) ; 

produces a change in J of AJ^*^ . 

3. Feedback Switching: 

a. Decrease in feedback: if 


AJ 


(ilj) 


Ap 


m 




(13) 


where e is a preset lower bound, then 


o ( i+D = „(i) c (0 


(14) 


where < 1 determines the reduction in contribution of w(x,p) for the 

( i + l) th stage, 

b. Increase in feedback: if 


Jfx(t f )] = 


max 


(15) 
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then 


C< 1+1 > * C^) (16) 

where J is an upper bound imposed by saturation of the analog variables, 
max 

Hybrid Implementation 

Program Structure . The basic structure of the hybrid program, written 
to implement the algorithm described in the previous section, is shown in 
Figure 1. The various segments are described as follows. 

Main program EXEC is used to set up the hybrid interface, initialize 
the problem, and monitor the progress of the overall iteration. Subroutine FN 
controls the objective function evaluation and makes logical decisions as to 
feedback changing, gradient evaluation, parameter updating, and convergence. 
Subroutine INT controls the mode of the analog computer to integrate the 
canonical equations from given initial conditions and returns a value of objec- 
tive function. Subroutine GBPRT calculates first and second derivatives of the 
objective function with respect to the parameters (co-state initial conditions) . 
Subroutine SD controls the steepest descent updating of p( t 0 ) and the step size 
used in equation ( 12) . Subroutine FDBK controls the changes in feedback and 
the level used. 

A flow chart of the hybrid program is given in Figure 2; the parameters 
are listed as follows: 

n 

Boundary error V e.(t ) 2 
i=l 1 

Minimum BE, used as a stopping function for the overall 
iteration 

Current feedback level, ^ 

Current value of objective function 
Lowest value of VN stored 

Upper bound on VN, Indicating analog saturation 

Lower bound on VN, used to accelerate convergence at each 
iteration stage 


BE 


BE . 
mm 

FB 

VN 

VS 

V 

max 

V . 
mm 
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GRADV Change in VN with respect to parameters recorded over one 
iterative step 

e Lower bound on GRADV . 

Computer Assignment . The canonical equations ( ll) are programmed 
on the analog computer { 68 0) . Integration of the equations is initiated by 
setting a sense line from the digital computer ( 640) . The period of integration 
is controlled by means of a counter on the analog logic patch board; the period 
is under program control since the counter is loaded from the 640 by setting 
sense lines. The state and co-state variables, the objective function, and the 
boundary-error terms are all calculated on the 680 and transferred to the 640 
through the ADC. 

The state and co-state initial conditions, the feedback coefficients, (and 
the Contour Modification weighting coefficients when appropriate) are calculated 
on the 640 and transferred to the 680 through the DAC* s and MDAC’ s. Various 
logical operations are made on the 680 to synchronize analog mode control with 
data transfer across the hybrid interface. All other operations are performed 
on the 640. 


Application 

The hybrid program, described in the previous section, has been applied 
to two second-order systems which were studied previously ( see Appendix A) . 
The same convergence problems as before were encountered at the final stage. 

Adjustment trajectories for the system with terminal cost ( System 1) 
are shown in Figure 3a. These can be compared with those obtained using the 
manual-iterative analog version of the algorithm, shown in Figure 3b [5]. The 
need to improve the final convergence is clearly indicated in both cases. The 
method used in the previous study was Contour Modification, which is discussed 
in the next section. 

Because the problem is much worse for more sensitive systems, such 
as System II, no further comparative results are available for application of 
the basic algorithm. 
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CONTOUR MODIFICATION 


Introduction 

The need for Contour Modification and the means for achieving it have 
been noted [5] . However, the necessity of using the system transition matrix, 
in solving for the required weighting coefficients a, makes the technique 
impractical for general use. 

An attempt to develop a strictly numerical approach and its imple- 
mentation on the hybrid computer are described in the following subsections. 
The idea behind this technique is that as the overall iteration progresses, 
much information about the topology of the objective function surface is gen- 
erated through repeated gradient calculations. It would appear reasonable 
that, given the means for achieving Contour Modification, use could be made 
of this topological information in choosing the necessary weighting coefficients. 
The concept is therefore one of a hill-climber operating on a "hill” which is 
constantly evolving towards an optimum shape, namely hyper spherical. 


Criterion Function E 

The orthogonality and eccentricity minimization criteria that were 
used for analytic calculation of the optimum weighting coefficients [ 5] are 
equivalent to the single criterion that the augmented objective function shall 
have eigenvalues which are all equal in magnitude. This condition can be 
recognized by examining the matrix of second derivatives of the objective 
function. When the off-diagonal terms are all zero and the diagonal terms all 
equal, the objective function will be hyperspherical. Thus the optimum 
weighting coefficients can be determined from the minimization of a criterion 
function based on the second derivative matrix. 

Let 


H = [h.j] ; i, j = 1, ... n 


be the matrix of second derivatives of the augmented objective function Jm 
with respect to p( t 0 ) . Then a suitable criterion function is given by 

E = + £2 (17) 
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where 


n-1 



( 18 ) 


E2 


n n 

l Z 


i=l j=l 


1 

2 


h..h . 

»j n 


1*1 


(19) 


Generation of E should be well suited to the hybrid computer in that 
the first and second derivatives can be determined by a finite difference 
method [8] involving repreated solution of the state and co-state equations. 
This method would seem preferable to that of using influence coefficients [ 9l 
to determine the gradients, with the attendant large increase in the number of 
equations to be programmed. 

Unfortunately, E itself is extremely sensitive to the values of a, with 
the result that implementation on the hybrid computer has not been very 
successful. However, experiments with all digital computations to obtain a* 
through minimization of E have been performed successfully. 


Minimization of E 

The choice of algorithm for minimizing E is dictated primarily by 
the way in which E is calculated. When H and hence E are determined 
by a finite-difference method, no analytic expression is available for comput- 
ing the gradient of E with respect to a . Grad E would itself have to be 
determined by a finite-difference method, leading to a very rapid increase in 
the number of function evaluations through solution of the system of canonical 
equations. 

One computation of E takes (2n 2 + l) equation solutions [8]. One 
computation of Grad E takes 2n evaluations of E , thus 2n( 2n 2 + l) equa- 
tion solutions. 

If influence coefficients are used to determine the gradients of the 
objective function, then an analytic expression for Grad E can be determined 
( see Appendix B) . The number of function evaluations is reduced substan- 
tially, at the expense of 2n 2 additional equations to program. 


11 



It is preferable to use a direct-search type of algorithm, which does 
not require gradients, for minimizing £ . The sensitivity of E , mentioned 
above, poses a problem for this type of search. However, various sophisti- 
cated algorithms have been developed, among them Zangwill’ s modification 
[ 10] of Powell* s method f 11] , generally considered to be one of the most 
effective techniques [ 12] . This type of search requires repeated linear 
minimizations which can be performed in a variety of ways, among them 
cubic interpolation, quadratic approximation, and Golden Section search [12], 
The latter method was used for the hybrid formulation because of its basic 
simplicity; although requiring more function evaluations, it is less sensitive 
to errors in data transfer across the hybrid interface. 


Implementation 

Digital Computation . Various algorithms have been applied to the 
problem of finding _ar* for System I. The results are presented in Table 1. 
The label "formula” refers to E being evaluated through the solution of the 
state transition matrix, thereby allowing gradient calculation in a closed 
form. "Analytic'* refers to a closed-form solution of the optimization prob- 
lem for a — possible for this simple case. In all other iterations, E was 
obtained through a finite-difference evaluation of H , 

Although Zangwill’ s algorithm used with GOLDl required more than 
twice as many function evaluations as with QUADF, the simplicity of the 
method is generally more reliable, as evidenced by the results shown for 
System II. Although only second order, this system displays an objective 
function V and a criterion function E which are extremely sensitive to their 
respective parameters. 

Hybrid Computation . The results of attempting a hybrid minimization 
of E are presented in Table 2. Convergence is erratic, the principal source 
of trouble being the evaluation of E . Computing H via finite differences, 
and then E from the elements of H , involves several subtractions of 
nominally similar quantities. The accuracy of the A-D and D-A converters 
in the hybrid interface is such that these subtractions can introduce large 
errors into the iteration. 


It was hoped to combine the iteration on a with the feedback iteration 
on p( t 0 ) , described earlier. This would have allowed an evaluation of the 
"adaptive hill" concept discussed in the introduction to this section. How- 
ever, the memory capacity of the EAI 640 computer is too small for the two 
programs to be loaded together so the doubie iteration could not be attempted. 
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TABLE 1. COMPUTATION OF ct* (DIGITAL) 


Algorithm 

Iterations 

Function 

Evaluations 

Y . 
min 

tv,* 

ro 

* 

System I 

Z/QUADF 

4 

44 

0. 0219 

2. 239 

-1.646 

Z/GOLD1 

3 

94 

0. 01925 

2.223 

-1.632 

CONJ/GOLD1 
( formula) 

11 

217 

0. 00787 

2. 225 

-1.634 

CONJ /CUBIC 
( formula) 

4 

38 

0.00028 

2.232 

-1.641 

Z/GOLD1 
( formula) 

3 

94 

0. 01926 

2.223 

-1.632 

Z/QUADF 
( formula) 

4 

40 

0, 0998 

2.229 

- 1. 634 

HYBRID 

3 

111 

1.2500 

2.017 

- 1. 463 

ANALYTIC 


| 


2.230 

-1.640 

System II 

Z/G.OLD1 

10 

548 

0. 000 

19.39 

-2. 209 

Z/QUADF 


No Convei 

rgence 



ANALYTIC 




19.90 
... ■ 

-2. 275 


Key: Z — ZangwilT s Direct Search 

CONJ — Conjugate Gradient Search 
QUADF — Linear Search with Quadratic Fit 
CUBIC — Linear Search with Cubic Fit 
GOLD1 — Linear Search with Golden Sections 
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TABLE 2. COMPUTATION OF 0*= ( HYBRID) 


Criterion 

Function 

(Y) 

Iterations 

Function 

Evaluations 

Y . 
mm 

Of j* 

-- - 

a 2 * 

E 

6 

i 

188 

0.5820 

2. 168 

-1.628 

E^ 

3 

111 

1.2500 

2. 017 

-1.463 

E Vz 

2 i 

153 

6.442 

2.362 

-1.625 


Nevertheless, the basic algorithm was used with optimum values of 
a computed in the all-digital program. Adjustment trajectories for System 
I are shown in Figure 4, Comparison with Figure 3a shows the effectiveness 
of Contour Modification in improving the overall convergence. 

A hand-plot of the progress of the iteration for System II is given in 
Figure 5; an automatic plot of the same adjustment trajectory proved to be 
too crowded to be meaningful. 


Discussion 

The results presented in the last subsection indicate that a criterion 
function for choosing the optimum weighting coefficients, to achieve Contour 
Modification, can be minimized through a direct-search type of algorithm. 
However, the features of the criterion function and its evaluation on the 
hybrid computer are such that inconclusive results have been obtained as to 
its usefulness. It is not clear at present what alternative criterion could be 
used that might lend itself better to minimization on the hybrid computer. 


PROGRESSIVE UPDATING 
Introduction 

The techniques described in the sections on Feedback Switching and 
Contour Modification have been developed to enable solutions to trajectory 
optimization problems to be obtained on a hybrid computer. Optimal guidance 
is essentially a trajectory optimization problem so that nominal guidance laws 
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can be computed. However, as noted previously, optimal solutions are gen- 
erally very sensitive and large errors can appear toward the end of a trajec- 
tory interval, as a result of the limited accuracy obtainable from a hybrid 
computation. 

A Progressive Computation or Progressive Updating method has been 
developed to overcome the error buildup [6]. Not only is the nominal guidance 
law corrected as the flight progresses but also the features for on-line guid- 
ance are contained in the method. Since the trajectory optimization problem 
is repeatedly solved for diminishing intervals, changes in the required end 
conditions or disturbances to the trajectory are easily accounted for. A 
description of the method follows and experimental results are presented. 


Method 

Solution of the trajectory optimization problem has been obtained in 
terms of the initial condition vector for the co-state variables. To recon- 
struct the optimal control function in real time, a simulation of the system 
co-state is required, to be used in conjunction with the measured (or esti- 
mated) system state vector. The interconnection of a high-speed computer 
model and real-time hardware (or plant) was suggested by Paiewonsky [ 13] 
and is shoWn in Figure 6. 

The initial high-speed computation is carried out over the complete 
control interval (t^~t^) an d jrt(t^) is transferred to the real-time co-state 

simulation. After a given time interval, r , has elapsed, the plant state and 
co-state are sampled and transferred to the model. A new high-speed com- 
putation is then started over the reduced control interval t^ - (t^ + r) with 

x(t Q + t) as the initial state vector and ^(t Q + r) as the first guess at the 

required optimal vector £*( + r) . When obtained, the new optimal initial 

co-state vector for the control interval t f - (t Q + t) is transferred to the 

real-time co-state integrators and the plant trajectory proceeds from 
x(t Q + t) and £*(t^ + t) • The procedure Is repeated at regular intervals t 

throughout the overall control interval (t f - t ) and thus the control is con- 
tinually updated, ensuring a nearly optimum trajectory over the remaining 
time. 


The time taken to perform the high-speed computation has been 
neglected in the preceding description but can be allowed for by using the 
model to predict the state of the plant after the computation time. At . In 
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this way, for the r-th computation, p*(t 0 + rr + At) is computed on the 
model instead of p*( t 0 + rr) , relying on the fact that x( t 0 + rr + At) , as 
measured on the model, should be a close approximation to x( t 0 + rr + At) 
on the plant. 


Hybrid Simulation 

The updating scheme described above was simulated on the hybrid 
computer, although the allowance for computing time was not included. This 
omission greatly simplified the timing and synchronization of the simulation, 
which is described by the following steps: 

1. From the given initial state x( t 0 ) and the guessed initial co-state 

/ 

p( to) , the approximation to the optimal initial co- state p*' '(t 0 ) is com- 

puted on the model. Predetermined weighting coefficients a( t 0 ) are used. 

2. The plant equations are integrated over an interval from initial 

( 

conditions x( t 0 ) , p*' ( t 0 ) ; x( t 0 + r ) , p( t 0 + t) and the cost incurred 

x 0 ( t 0 + t) are measured from the plant. 

3. From the measured initial state x( t 0 + r) and the measured 
initial co-state p( t 0 + t) , the approximation to the optimal initial co-state 

( ml 

' ( t„ + t) is computed On the model. Predetermined weighting coeffi- 
cients n( t 0 + r) are used. 

4. Replacing by t^, r»l,2, ... , (q- l), where 


t 

r 


r-1 


+ t 


q 


<*r 




y 


steps 2 through 4 are repeated until r = q - 1 . 

5. With r = q - 1 and hence t „ = t tf , the final state x(tj , final 

r+1 i — f 

co-state p(t^) , and final cost incurred x^(t^) are measured from the plant. 
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6. The total cost incurred by the plant over the complete control 
interval, i. e. , the performance index J , is given by 



K[x(t f ) ] 



( 20 ) 


where 



Both model and plant were simulated on the analog portion of the 
hybrid computer. Time scales were under digital control, with the model 
being run one thousand times faster than the plant. 

Because of the difficulty of implementing the computation of a on the 
hybrid computer, the weighting coefficients were computed for the various 
time intervals in an all-digital program and input as data to the hybrid 
program. 


Experimental Results 

To assess the effectiveness of progressive updating, the method 
should be applied to a sensitive system, in other words, one in which the 

trajectory computed over the complete control interval from p*^ m ^( t # ) 
diverges from the true optimum trajectory. 

Comparative trajectories for System I are shown in Figure 7, 

Clearly, the single trajectories computed from p*^ m ^(to) (as obtained in 
Figure 4) are close to the optimum trajectories. ( The latter were obtained 
from an all-digital solution as in Reference 6) . Therefore, there is nothing 
to be gained from using Progressive Updating with this system. 
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However, this is not the case for System II. Comparative trajectories 
are shown in Figure 8 and divergence from the optimum is evident. The 
third trajectories in Figure 8 were obtained through application of the Pro- 
gressive Updating scheme; strip-chart recordings of the same trajectories 
are shown in Figure 9. The effect of Progressive Updating in driving the 
state and co- state trajectories towards the optimum is clearly seen. 

Comparative values of the performance index for the trajectories of 
Figure 8 are 

Single computation: J - 15. 08 
Progressive updating: J = 4. 03 
Digital ( optimum) computation: J = 3. 04 

These values were obtained from equation ( 20) , Table 3 gives the values 
of a used at each updating stage. 


TABLE 3. a* USED FOR EACH STAGE OF 
PROGRESSIVE UPDATING 


Updating 

Stage 

T 

a 

0 

( 

19.380 

-2.210 ) 

1 

( 

12. 510 

-1.232 ) 

2 

( 

7.4 85 

-0. 560 ) 

3 

( 

3.980 

-0.147 ) 

4 

{ 

1 . 682 

-0. 047 ) 

5 

( 

0. 518 

0.081 ) 


The principal on which Progressive Updating is founded is that tra- 
jectories generated from a hybrid solution ( inherently having some error) 
will be most nearly optimal during the early part of the trajectory. There- 
fore, the more often updating can occur, the more accurate will be the over- 
all trajectory. The faster a solution can be obtained, the more often updating 
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can occur. Consequently, on-board guidance updating requires as fast a 
computing method as possible, hence the consideration of a hybrid mode. 
The net result is a classic tradeoff between speed of computing and the 
accuracy of an individual solution. 


CONCLUSIONS 


On-line solutions of trajectory optimization problems, for use in real- 
time guidance updating schemes, require a high-speed iterative computation. 
One approach is to exploit the rapid integration capability of a hybrid com- 
puter. Certain computational difficulties arise, however, because of the 
limited precision and amplitude range on an analog computer. Possible 
methods for overcoming these difficulties have been studied in this report 
with varying degrees of success. 

Solution of the optimization problem is obtained in terms of the initial 
co-state vector. Convergence from an arbitrary initial guess is uncertain, 
due largely to the inherent instability in the canonical equations, causing 
saturation of analog variables. A feedback technique has been developed to 
force the state and co- state variables toward satisfying given boundary con- 
ditions, thereby maintaining stability in the trajectories. Although basically 
empirical in its implementation, the technique has been successfully applied 
to simple dynamical systems, simulated on the hybrid computer. 

Final convergence of the feedback technique is, however, subject to 
the difficulties mentioned above since feedback is removed over the last 
stage. The known final boundary conditions have been used to stabilize the 
final convergence by augmenting the objective function in the solution space. 
The criterion used for such objective function contour modification has been 
to diagonalize the matrix of second derivatives, generating eigenvalues of 
equal magnitude. 

The appropriate weighting coefficients for modification have been 
successfully calculated with an all-digital program. A similar hybrid 
program, however, was not successful in generating a stable solution. The 
extreme sensitivity of the criterion function itself has been identified as con- 
tributing to this difficulty, together with inherent noise in the analog and 
linkage portions of the computer. 
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A simulation of progressive control function updating was performed, 
using weighting coefficients calculated digitally. Both a high-speed model and 
a simulated real-time plant were programmed on the analog computer. 
Experimental results have shown that Improved trajectory tracking can be 
obtained by regularly updating the optimal solution. The initial solution, 
computed by the feedback technique with contour modification, provides an 
appropriate control function over the initial stages of the trajectory. The 
tendency for this primary solution to be in error as the trajectory progresses 
is corrected by the updating scheme. 

The hybrid computer has been shown to be useful in generating on-line 
optimal guidance functions. However, the stability and speed of the primary 
solution remains an area for further study. 
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APPENDIX A 


SYSTEM EQUATIONS 


The canonical equations, boundary conditions, and performance indices 
for the systems on which the algorithms were tested are given below: 

1. System with Terminal Cost: 


J - x 2 ( t,) + x 2 (t.) + / 5 u 2 dt 
It 2 £ / 

E 0 




"l. o' 


2 x t ( t f ) 

2<‘o> * 


; p( t f ) 3 



o 

1 


2 x 2 ( t f ) 


2. Lag plus Integrator System: 




1 

o 

1 


i — 

o 

t 

O 

11 


II 


. 

— 1 
o 


_o_ 
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APPENDIX B 


ANALYTIC DETERMINATION OF GRAD E 


The components of H = (h ] i, j = 1, ... n, can be evaluated from 

successive measurements of the first derivatives m., i= 1, ... n. 

1 

( s) 

At a nominal point s , measure m v ' , r = 1, . . . n. At successive 

(q) r 

points q, measure m , r = q, . . . n; q = 1, . . . n where points q are 

r 

obtained from the nominal point s by perturbing P q ( t 0 ) by an amount Ap^(t Q ) 
and holding all other components of £(t Q ) fixed* Then, 


h.. * 


( i) ( s) 

m. - m, 
i l 


ii 


i = 1, ... n 


h.. - 
ij 


1 1 

2 Ap.{t 0 ) 


i > j 5 i — l,...n 
j — 1, ... n 


= h.. 

n 


n- 1 n n 

e - y) (h.. - h ) + y y — (h..h..) 

n l+l, l +1 .H. A* 2 11 11 

1=1 i=l j=l J J 


The derivative of E with respect to a is required: 


a e 

9a, 


n-1 


2 J ^ h ii “ h i+l,i+l^ 

i=l 


9 h.. 9h 

ii i+1, i+1 

9a. 9a, 

k k 


n . / 9h.. 9h.,\ 

^ + h i‘ %) ' 


(B-l) 
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Now m can be expressed in terms of influence coefficients [9J, 
from additional differential equations! 


m 


, (i) - i 

r=l 


y( j) + Z 

x ^ri p ri 
t r 


where 


y - (t) " 9pJO 


TV 


9x (t) 
r 

7 *? 


8P r (t) 

Z rl (t) = 


V° ■ 


■ V 


5 V 


P r 3P r ( V 


Writing 


A m. 


«> - m .< *.„,<*> 
1 1 



evaluated 
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then 



Substituting equation ( B-2) into equation ( B-l) allows 9 E/9a^ 

ated, since A,, and AV are functions of a . 

V P 


(B-2) 

to be evalu- 
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Figure 1. Hybrid program structure. 
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CALL FOBK: 
REDUCE FB 


FB = 0? . 

NO 


X IS 
BE<BE n 


/ HAS \ 
^/STABLE THAJ-\ 
ECTORY BEEN RUN 
s s s WlTH CURRENT^ 
\ FB? / 


IS 

VN < VS? 


CONVERGED 


RETURN 
' EXEC 


CALL GRPRT: 

EVALUATE 

GRADIENTS 


CALL SO: 
REDUCE 
STEPSIZE 


^ IS \ 

GRADV < e 

V 1 X 


CALL GRPRT: 



EVALUATE 


CALL SD: 
UPDATE £.10) 

GRADIENTS 

1 

CALL FOBK: 
REDUCE FB 


CALL SD: 
INCREASE 
STEPSIZE; 
UPDATE p<0) 


RETURN 

EXEC 


RETURN 

EXEC 


RETURN 

EXEC 


Figure 2. Flow chart of hybrid program. 
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Figure 4. Adjustment trajectories for System I 
(Countour Modification). 
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Figure 5. Adjustment trajectories for System H 
(hand plot) . 




Figure 6. Progressive Updating model 
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a. State trajectories. 


Figure 7. Comparative trajectories for System I 
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b. Co-state trajectories. 

Figure 7. Comparative trajectories for System I. 
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a. State trajectories. 

Figure 8. Comparative trajectories for System n. 
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b. Co-state trajectories. 


Figure 8. Comparative trajectories for System II 
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Figure 9. Strip chart for System II 
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b. Progressive trajectories. 


Figure 9. Strip chart for System II, 
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